An Adaptable Human Vision Model for Subjective Video Quality Rating Prediction among Cif, Sd, Hd and E-cinema
ثبت نشده
چکیده
A new highly adaptable model for predicting human vision response is presented for enabling an improved method of predicting subjective video quality. The ability to adapt enables comparison of video with dissimilar image sizes, viewing environments, frame rates, video quality classes, etc. (for example HD vs. SD vs. CIF). Model test results are compared with human response. Responses are from stimuli covering both JND (just noticeable differences) and supra-threshold (extending to near the opposite extreme). Supra-threshold responses compared include adaptation behavior exemplified by nonlinear response responsible for significant sensitivity changes, masking and visual illusions. Given the prediction of visible impairments, simulation of the contextual adaptation that occurs during the training portion of ITU-R BT.500 (subjective assessment methodology) is used for predicting DMOS. 1. 0 Introduction Previously developed models for predicting subjective video quality have one or both of the following: a) objective impairment measurements, each weighted by estimates of subjective annoyance, and summed to produce a single metric per frame and/or sequence [1] [2] and/or b) an attempt to at least partially mimic human vision system response [3] [4] [5] [6] [7] [8] [9]. However, vast stimulus-response data from vision science literature has remained largely unexploited. Models that have been developed within the vision science community typically only address isolated human vision system behaviors, or are parameterized as to not be readily adaptable to the application of video quality as exemplified by [10] [11] [12] [13] [14] [15] [16] [17] [18] [19] [20] [21] [22]. As a result, current standards and practices for subjective video quality measurement ignore substantial changes in human vision sensitivities. For example, ITU-T J.144 models such as VQM [1] do not take into account differences in viewing conditions, frame rates, resolutions or nominal quality context. And since J.144 was developed for standard definition (SD), using it for predicting subjective video quality ratings of other formats such as CIF or HD is problematic [23]. There have been reports that use of J.144 methods can be problematic even with some SD video if content and/or impairments are quite different from that used for development and verification of the algorithms. In addition to human vision perceptual response issues, display device differences and the nominal quality of each format also play important roles [23]. Even for a given format, display and viewing conditions, application dependent nominal quality range (for example, set by bit rate or program content nominal complexity) can vary significantly. This has made necessary a) normalizing subjects to the range of quality of video in ITU-R BT.500 [24] and b) the bit rate and frame rate matrix of categories of VQEG impairment ranges and types respectively [25]. As such, the context of nominal quality range affects the sensitivity and quality rating scales such as MOS and DMOS. Using the simulation approach shown in the system diagram of Figure 1, video quality rating prediction requires a human vision model that can adapt as the human vision system does, to different displays, viewing conditions, video context and formats, along with a method for adapting quality scales as humans do in subjective quality rating training and conditioning. Such a human vision model has been developed and patented [26] [27] [28] [29]. An overview of its specification, components, calibration and validation are covered in the remaining sections of this paper. Fig. 1. Simulation system diagram for subjective video quality rating prediction. 2.0 Defining Human Vision Model Response Specifications The specifications of the model may be formulated from the data available from thousands of published results of experiments in vision science. Stimulus-response pairs become the numeric framework for describing the input-output behavior desired from the human vision model. A good introductory overview of experiments and associated stimulusresponse sensitivity analysis is given in [30]. For brevity, the scope of the remaining discussion will be limited to achromatic light (luminance) only. Analysis for chromatic light follows a similar path.
منابع مشابه
Applications of Novel Video Compression Technology in Television
Digital technology is now widespread throughout television production, and opportunities for the application of novel video compression technologies are increasing throughout television production, distribution and delivery. The requirements for these applications are diverse, and such technologies will need to be flexible and easily integrated. The paper discusses five technology areas that th...
متن کاملQuality impact of video format and scaling in the context of IPTV
In this paper we investigate the influence of the video format and of re-scaling on the perceived video quality in the context of IPTV. Addressed resolutions are typical of IPTV: Standard Definition (SD, 720 x 576 pixels) and High Definition (HD, 1920x1080 pixels). Re-scaling comprises SD to HD (4:3, 1350x1080) upscaling and cropping from HD 16:9 (1920x1080) to HD 4:3 (1350x1080) format. In par...
متن کاملSS-SSIM and MS-SSIM for digital cinema applications
One of the key issues for a successful roll out of digital cinema is in the quality it offers. The most practical and least expensive way of measuring quality of multimedia content is through the use of objective metrics. In addition to the widely used objective quality metric peak signal-to-noise ratio (PSNR), recently other metrics such as single scale structural similarity (SS-SSIM) and mult...
متن کاملA Machine Learning Approach to No-Reference Objective Video Quality Assessment for High Definition Resources
The video quality assessment must be adapted to the human visual system, which is why researchers have performed subjective viewing experiments in order to obtain the conditions of encoding of video systems to provide the best quality to the user. The objective of this study is to assess the video quality using image features extraction without using reference video. RMSE values and processing ...
متن کاملA parametric model for perceptual video quality estimation
In this paper, a parametric model is proposed which provides estimation for the perceived quality of video, coded with different codecs, at any bit rate and display format. The video quality metric used is one of the standardized Full Reference models in Recommendations ITU-T J.144 and ITU-R BT.1683. The proposed model is based on the video quality estimation described in Recommendation ITU-T G...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010